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2 ' According to Wigner theorem, transformations of quantum states which preserve 

^ , the probabihties are either unitary or antiunitary. This short communication presents 

^^' an elementary proof of this theorem that significantly departs from the numerous ones 

already existing in the literature. The main line of the argument remains valid even 
in quantum field theory where Hilbert spaces are non-separable. 

t^ ■ 1 Preliminaries 

m ■ 

Wigner theorem is a cornerstone of theoretical physics since it encapsulates all the linear 
^T ' structure of quantum transformations, among which the evolution of quantum systems 

^r^ . (aside from the measurement process). More precisely, given a Hilbert space H endowed 

with a Hermitian product (x|'0) Wigner, in the early 30's(1959, Appendix to chap. 20, 
pp 233-236, for the updated English translation), acknowledged that any transforma- 



'k> ■ tion T : ip \-^ IV')'^ = T(|^)) such that 



.=?.: V(|x),|V'))g^', r(xlV')" I = I (xIV') I (1) 

is either 

(a) linear r(ci|V'i) +C2|V'2)) = (ci|'0i)'^ + C2\ip2V) and unitary T^^ = T*; 
or 

(b) antilinear T(ci|'0i) + C2\ip2)) = c*|'0i)^ + C2\tp2)'^ and unitary T^^ = T* (such a 
map is also called antiunitary). 

We shall systematically use the usual Dirac bra-ket notation and, in the above de- 
finitions, (ci,C2) stands for any pair of complex coefficients, (|'0i), IV'2)) is any pair of 



*mouchet@lmpt. univ-tours.fr 



elements of Ti. The Hermitian conjugate will be denoted by ( )* and therefore, if z is just 
one complex number, z* stands for its complex conjugate. 

Since Wigner's original work that pertained to the representation of the rotation group, 
many proofs and generalizations have been proposed whose levels of rigour are not neces- 
sarily correlated to their length but, rather, vary depending on the concern of their author. 
See for example (Simon et al., 2008) and references therein to which I shall add (Fonda 
& Ghirardi, 1970, § 1.3.2) and the concise and elegant proof given by Messiah (1959, 
§ XV-2). The present work is further added to this list because it appears to be almost 
a back-of-the-envelope presentation while keeping a level of rigour that is acceptable by 
physicists (hopefully, little additionnal work on the main key ideas should meet to the 
requirements of mathematicians as well). Moreover, the majority of the previous proofs, if 
not all of them, can hardly be transposed to non-separable Hilbert spaces, that is to spaces 
where a countable orthonormal basis does not exist. Yet, in quantum field theory, such 
non-separable Hilbert spaces are unavoidable: any continuous canonical transformation 
or rearrangement of the infinitely many degrees of freedom — that physically describe 
a renormalisation of the bare particles into the dressed ones like, for instance, when a 
condensation occurs — requires that the Hilbert space is made of a continuous family of 
orthogonal Fock spaces and one cannot content one self with the unique Fock space that 
represents the physical (dressed) particles; see for example Emch (1972, §1.1) or Umezawa 
(1993, chap. 3). 

The proof presented in the second part of the present article will at first make use of 
a (at most) countable basis {\^u))^^j in T~l-, 

ifulff,) = Suf, , (2) 

but the discreteness of the set I will be just a matter of convenience. The transposition to a 
continuous (multi-)index is straightforward (we will not use any induction arguments that 
would prevent such generalization) and physicists are quite used to it. For instance, with 
an appropriate choice of normalization, the Kronecker symbol in ([2|) is replaced by a Dirac 
distribution, the discrete sequences z = {z^)^^j labelled by i^ become regular functions z : 
v 1-^ z{v), functions of z become functionals, matrices are turned into operators, etc. 

Unlike in many proofs, we shall not suppose a priori that T is bijective. The only 
additional property will be that T is diff'erentiable twice (just once may be sufficient but we 
will not try no minimize the requirements on the regularity of the transformation) . Let us 
just mention that such smoothness is a reasonable supposition based on physical grounds. 
Except during the measuring process, when the number of degrees of freedom involved in 
the interaction of the system with a measuring device becomes infinite — and we know 
that the superposition principle as well as condition ([T|) are lost by the transformation on 
the states induced by the measuring process — , we have never observed any discontinuity 
nor singularity with respect to the quantum state in a transformation. 

To see how this differentiable hypothesis simplifies considerably the proof of Wigner 
theorem by offering a simple strategy, let us see how this works in the Euclidean case 
where the analogous of the Wigner theorem is known as Mazur-Ulam theorem(1932, this 
first version was published the following year after Wigner's). Let us suppose that T is 



a differentiable application in an Euclidean space E preserving the scalar product "•" i.e. 
such that 

V(M,7;)Gf^ T{u)-T(v) = u-v . (3) 

Then, differentiating ([3D with respect to u and v leads to 

^{u,v)^E\ {T'{u)fT\v) = \ (4) 

(" * " denotes the transposition and " ' " the derivative) which shows immediately that 
the Jacobian matrix T'[v) is invertible, independent of v and indeed orthogonal. Since 
moreover T(0) = (the scalar product is non degenerate), we have proven that T is 
necessary a linear orthogonal transformation (it is obviously a sufficient condition). 

Under the alternative hypothesis of being surjective, the Mazur-Ulam theorem is also 
immediate since, then, the orthonormal basis {ev)v&i is mapped to an orthonormal ba- 
sis {e^)uei on which, for any v in £, we can expand v'^ and then. 
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Y.^el-v-)el = Y,{e,-v)el. (5) 



If there were a non-null vector w va £ that were orthogonal to the subspace spanned 
by (e^)j^g7, then its preimage T~-'^(u)), whose existence is guaranteed by the surjectivity 
hypothesis, would be the null vector, being a vector orthogonal to any Cp. This is in 
contradiction with T(0) = 0. Relation (0) can be written as 



( ^{(^v ■v)e^\ = ^(e^ • v) T{t 



(6) 



which expresses the linearity of T whose orthogonality follows. However, in the Her- 
mitian case, the possible phase factors that may appear in the relation between '^{ipi,\ip) 
and {ipy\ip) make the proofs following the second line of thought much less straightforward. 

2 The core of the proof 

The invariance property formulated in terms of complex analysis: We shall take 
advantage of the isomorphism that allows us to identify the Hilbert space % to the set 
of sequences of complex numbers z = [zu)uei- More precisely, once given an orthonomal 
basis {\^v))y^p we can map any \iIj) mT-Lio z with z^, = {ipy\ilj). The Hermitian product 
in % reads 

{w\z) = w* z = ^wl z^ = {x\tlj) (7) 



with z = {{'^u\'^))^^^j and w = {{'fu\x))yizj- ^^y application T in H can be seen 
as a either a function T{x,y) of the sequence of the real part x = Rez and imaginary 
part y = Im z of the components of the state, or rather as a function of two independent 
complex sequences z and z 

T{z,z)''^'r{{-z + z)/2M-z-z)/2) (8) 



evaluated at z = z* . When T can be differentiated, we can define the derivatives with 
respect to the complex variables by (Dubrovine et al., 1984, § 12.1, for instance) 

dzT ^^ \{^d,-idy)T; (9a) 
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dzT ''^ -{d, + ;dy)T. (9b) 



In particular, T is analytic if and only if d^T = 0. From the differential of T, we get 

d,,[T{z,z*)) = {d,T){z,z*). (10) 

In the following we will drop the distinction between z* and the variable z that may vary 
independently of z because the continuation of any application T{z^ z) defined by dH) in 
the domain where z ^ z* \s unique. 

In this language, the invariance condition ([T|) can be reformulated as follows, 

V(u;, z) G n^, I {T{w, w*))* T{z, z*)\ = \w* z\ . (11) 

Therefore we have two possibilities : there exists a real function 6 of the complex vari- 
ables {WjW* , z, z*) such that 

(a) either {T{w,w*)y T{z, z*) = e'^^'"''"*''''*^w* z ; (12a) 

(b) or {T{z, z*))* T{w, w*) = ^-'''^(^''"''''''^w* z . (12b) 

To understand where these two conditions come from, first divide the both sides of the 
equality in (jlip by \w* z\ when non vanishing (then eventually include these cases by 
continuity of the transformation T). Thus, we are led to an equation of the form \Z\ = 1 
where Z is a complex number. Then, the argument -q oi Z can take the two solutions 
±arccos(ReZ) in [— vr, vr] of the equation cosr/ = ReZ. The two options (a) and (b) 
correspond to the two forms e±^--^-^^^°<^^z) ^^^^ \s Z = q^^'^^^^^^^z) ^^ ^* ^ giarccos{ReZ)^ 

Permuting {w,w*) and {z,z*), then by conjugation we have necessarily 

9{z,z\w,w*) = -9iw,w*,z,z*) (13) 

which incidentally shows that 0(0,0,0,0) = 0. 

Adjusting the phase: The first step of our proof is to redefine the phase of the trans- 
formed states by considering 

f{z,z*) = e'"'-''''^T{z,z*) (14) 

where a is a real function. Conditions (1121) can be written 



(a) either {T{w,w*)y T{z,z*) = e'^^'"''"*^'^''^w* z ; (15a) 

(b) or {f{z,z*)yf{w,w*) = e-'^'''"''"'''''*^w*z. (15b) 



with 

0{w, w* , z, z*) = 9{w, w* , z, z*) + a{z, z*) — a{w, w*) . (16) 

The latter identity can be used to cancel 9(0, 0, z, z*) with the choice 

a{z, z*) = -6(0, 0, z, z*) = e{z, z*, 0, 0) . (17) 

Therefore without loss of generality, in the following we can consider that for all z 

0(O,O,z,z*) = O. (18) 

Differentiation : The second step of the proof is to differentiate the conditions written 
in (fT2]) with respect to w* and z. From ()12ap . using du)*{T{w,w*)) = [dwT{w,w*)) , we 
get 

{d^Tiw,w*)yd,Tiz,z*) = e^^K-*'-.-*) [l+ 

+ \w*dw*6{w,w* , z, z*) + idzO{w,'w* , z, z*)z 
+ (idl^*6{w,w*,z,z*) — d^0{w,w*,z,z*) d^*9{w,w*,z,z*)^ w* z . (19) 

Because of this last computation, and in particular because of the presence of the Hes- 
sian of 0, we assume that the transformation is differentiable twice. When evaluated 
for {w,w*) = (0,0), we get 

(a,r(o,o))*a,r(z,z*) = i (20) 

since 1^ implies 8^0(0,0, z,z*) = 0. Then {dzT{z, z*))'^ = (a^r(0,0))* appears to be 
independent of {z,z*) and unitary. Moreover, because T(0, 0) = 0, we have, for all z, 

T{z,z*) = d,T{0,0)z (21) 

and therefore z >-^ z"^ = T(z,z*) is linear. The linear operator U defined by its matrix 
elements (<^y|C/ !<?!';/')= (92r(0, 0))^,^, is such that U~^ = U* and the expression (f2T]l can be 
written as {ipu\ip)'^ = J^u'i^J^l^l^i^') ifi^'l'^') = {^u\U\tp), hence \ip)'^ = C/|V') which is the 
first option offered by Wigner theorem. 

The second option comes when differentiating ()12bp which just modifies the left-hand 
side of (|19p and the irrelevant sign in front of 9 in the right-hand side. Then we obtain 



(a,.r(o,o))*a,*r(z,z*) = i (22) 

and eventually 

T{z,z*) = dz*T{0,0) z* . (23) 

The transformation z i— ?• z'^ = T{z,z*) is antilinear. An antilinear operator A can be de- 
fined from its matrix elements {(j}i,\[A\(/),j'))= {dz*T{0,0))j^^, and we have 

{4>u\i^V = Eu'{<t>u\ (i|0.')) {<t>u'\i^)* = {4>u\ (iE.' \<P.') i^u'li^)) = {4>u\ (ijV')) that is 

IV')^ = A\il;). The relation (a^*r(0,0))"^ = (a^.r(0,0))* reads i^^ = A*. Therefore 
case (b) implies the second alternative of Wigner theorem. 
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